Controlling cameras in sport events

ABSTRACT

A method for controlling cameras in a sport event, the method comprising steps executed by at least one computer, the steps comprising: during a sport event taking place in a constrained environment, receiving images of the sport event, the images being captured with a first pixels to time ratio, tracking a motion of a first object, using the images being captured with the first pixels to time ratio, detecting an arrival of the first object into a predefined positional relation to a second object during the tracked motion, and upon the detected arrival, initiating streaming of images being captured with a pixels to time ratio higher than the first pixels to time ratio by at least one selected camera.

FIELD AND BACKGROUND OF THE INVENTION

The present invention relates to controlling cameras and, moreparticularly, but not exclusively to controlling cameras in sportevents.

Numerous popular sport events such as soccer, basketball, cricket,tennis or baseball, are played globally. When played, many of the gamesare broadcasted, to be shown as live telecasts to spectators, on TVchannels, web sites, etc.

Usually, videos of most of the sports events are generated in asubstantially manual manner, and require many people to be employed, saycameramen that continuously capture motion pictures of different regionsof a court area, as well as of corresponding activities of differentplayers involved in the sports events.

For example, during video filming of a soccer match played on a soccerfield, different cameramen may be active in different regions around thefield, and continuously capture video images of players running, kickinga ball, etc., during the soccer match. Usually, the cameramen areplaced, supervised, and directed by television directors, as known inthe art.

Some cameramen may be dedicated to following a ball continuously, as thematch progresses.

Specifically and additionally, there may also be a cameraman who isdedicated to identifying and capturing most interesting events duringthe match—such as a goal, an in/out event, etc., which events areusually of interest to spectators, as well as to officials such as linejudges and side judges.

The cameraman who is dedicated to identifying and capturing the mostinteresting events, may use a camera of higher qualities, say a widerzooming range, a camera with better lenses, etc. However, the qualityand timing of capturing of the interesting events depends on the skills,experience, sense of timing, etc., of the dedicated cameraman and/or adirector supervising the cameraman, during the sport event's filming.

Thus, the broadcasting of sport events in general, and theidentification of events of interest (goals, in/outs, etc.) during sportevents, in particular, remain processes that depend on substantiallymanual management and control of cameras and cameramen.

SUMMARY OF THE INVENTION

According to one aspect of the present invention, there is provided amethod for controlling cameras in a sport event, the method comprisingsteps executed by at least one computer, the steps comprising: during asport event taking place in a constrained environment, receiving imagesof the sport event, the images being captured with a first pixels totime ratio, tracking a motion of a first object, using the images beingcaptured with the first pixels to time ratio, detecting an arrival ofthe first object into a predefined positional relation to a secondobject during the tracked motion, and upon the detected arrival,initiating streaming of images being captured with a pixels to timeratio higher than the first pixels to time ratio by at least oneselected camera.

According to a second aspect of the present invention, there is providedan apparatus for controlling cameras in a sport event, the apparatuscomprising: a computer, an image receiver, implemented on the computer,and configured to receive images of a sport event taking place in aconstrained environment, during the sport event, the images beingcaptured with a first pixels to time ratio, a motion tracker, incommunication with the image receiver, configured to track a motion of afirst object, using the images being captured with the first pixels totime ratio, a positional relation detector, in communication with themotion tracker, configured to detect an arrival of the first object intoa predefined positional relation to a second object during the trackedmotion, and a camera controller, in communication with the positionalrelation detector, configured to initiate streaming of images beingcaptured with a pixels to time ratio higher than the first pixels totime ratio by at least one selected camera, upon the detected arrival.

According to a third aspect of the present invention, there is provideda non-transitory computer readable medium storing computer executableinstructions for performing steps of controlling cameras in a sportevent, the steps comprising: during a sport event taking place in aconstrained environment, receiving images of the sport event, the imagesbeing captured with a first pixels to time ratio, tracking a motion of afirst object, using the images being captured with the first pixels totime ratio, detecting an arrival of the first object into a predefinedpositional relation to a second object during the tracked motion, andupon the detected arrival, initiating streaming of images being capturedwith a pixels to time ratio higher than the first pixels to time ratioby at least one selected camera.

Unless otherwise defined, all technical and scientific terms used hereinhave the same meaning as commonly understood by one of ordinary skill inthe art to which this invention belongs. The materials, methods, andexamples provided herein are illustrative only and not intended to belimiting.

Implementation of the method and system of the present inventioninvolves performing or completing certain selected tasks or stepsmanually, automatically, or a combination thereof.

Moreover, according to actual instrumentation and equipment of preferredembodiments of the method and system of the present invention, severalselected steps could be implemented by hardware or by software on anyoperating system of any firmware or a combination thereof.

For example, as hardware, selected steps of the invention could beimplemented as a chip or a circuit. As software, selected steps of theinvention could be implemented as a plurality of software instructionsbeing executed by a computer using any suitable operating system.

In any case, selected steps of the method and system of the inventioncould be described as being performed by a data processor, such as acomputing platform for executing a plurality of instructions.

BRIEF DESCRIPTION OF THE DRAWINGS

The invention is herein described, by way of example only, withreference to the accompanying drawings. With specific reference now tothe drawings in detail, it is stressed that the particulars shown are byway of example and for purposes of illustrative discussion of thepreferred embodiments of the present invention only, and are presentedin order to provide what is believed to be the most useful and readilyunderstood description of the principles and conceptual aspects of theinvention. The description taken with the drawings making apparent tothose skilled in the art how the several forms of the invention may beembodied in practice.

In the drawings:

FIG. 1 is a block diagram schematically illustrating an exemplaryapparatus, for controlling cameras in a sport event, according to anexemplary embodiment of the present invention.

FIG. 2 is a flowchart schematically illustrating an exemplary method,for controlling cameras in a sport event, according to an exemplaryembodiment of the present invention.

FIG. 3A is a block diagram schematically illustrating a first exemplaryscenario of controlling cameras in a sport event, according to anexemplary embodiment of the present invention.

FIG. 3B is a block diagram schematically illustrating a second exemplaryscenario of controlling cameras in a sport event, according to anexemplary embodiment of the present invention.

FIG. 4 is a block diagram schematically illustrating an exemplarycomputer readable medium storing computer executable instructions forperforming steps of controlling cameras in a sport event, according toan exemplary embodiment of the present invention.

FIG. 5 is a top 3D perspective diagram of a system incorporated in areal sport environment, according to an exemplary embodiment of thepresent invention.

FIG. 6 is a flowchart illustrating activity of an event predictionmodule, according to an exemplary embodiment of the present invention.

FIG. 7 is a flowchart illustrating activity of high frequency frameanalysis module, according to an exemplary embodiment of the presentinvention.

FIG. 8 is a flowchart illustrating activity of embedding frames in amultimedia stream module, according to an exemplary embodiment of thepresent invention.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present embodiments comprise an apparatus and a method forcontrolling cameras in a sport event.

Every day, numerous popular sport events such as soccer matches,basketball games, cricket games, tennis matches or baseball games, areplayed globally. When played, many of those sport events arebroadcasted, to be shown as live telecasts to spectators, on TVchannels, web sites, etc.

Usually, videos of most of the sports events are generated in asubstantially manual manner, which requires many people to be employed,for example cameramen that continuously capture motion pictures ofdifferent regions of a court area, and of corresponding activities ofdifferent players involved in the sports events. Further employed arecameramen dedicated to identifying and capturing most interesting eventsduring the match—such as a goal, an in/out event, etc., which events areusually of interest to spectators, as well as to officials such as linejudges.

According to some embodiment of the present invention, during a sportevent taking place in a constrained environment (say a tennis court or afootball field), there are received images of the sport event, in realtime, as the images are being captured with a first pixels to timeratio. The received images captured with the first pixels to time ratio,are used to track a first object's motion (say a tennis ball's or asoccer player's motion.

Upon automatic detection of an arrival of the first object (say thetennis ball) into a predefined positional relation to a second object(say to a volleyball net, to a soccer gate, or to a boundary line on atennis court), there is automatically initiated a streaming of imagesbeing captured with a pixels to time ratio higher than the first pixelsto time ratio—say in a higher image capture frequency, in a higher imageresolution, or with both the image capture frequency and the imageresolution being higher, by one or more camera(s).

In one example, a tennis ball's motion, away from a player who servesthe ball to an opponent, is tracked using images captured by cameraspositioned over corners of a tennis court with a first pixels to timeratio (say in a first image capture frequency and resolution), asdescribed in further detail hereinbelow.

Optionally, the motion is tracked through a stereoscopic or otheranalysis of the images captured with the first pixels to time ratio,which analysis yields a three dimensional (3D) space which representsthe constrained environment, as described in further detail hereinbelow.

In the example, when the ball's distance from a net positioned in themiddle of the court closes into below a predefined threshold, during theball's tracked motion away from the player, there is initiated astreaming of images being captured with a pixels to time ratio higherthan the first pixels to time ratio.

The images are captured with the higher pixels to time ratio by one ormore selected cameras, say by cameras of a higher capture frequency, ahigher resolution, or by cameras of both a higher resolution and ahigher image capture frequency, which cameras are positioned at two endsof the net, as described in further detail hereinbelow.

Thus, according to some embodiments, an event of interest (say an outevent, a net event, etc.) may be captured in the images captured withthe higher pixels to time ratio, which images are automatically streamedupon the detection of the arrival of the first object into thepredefined positional relation, during the tracked motion of the firstobject.

With the initiation of the streaming being automatically (rather thanmanually) initiated upon the automatic detection of the first object'sarrival into the predefined positional relation to the second object,the event of interest (say a goal or an out event) is much less likelyto be missed. Further, the event of interest is much less likely to becaptured with a lower, and thus less informative, pixels to time ratio.

Thus, in a first example, during a tennis match, a ball's motion may betracked, through an analysis of images captured in a resolution of2048×1080 pixels, say using 2K digital cameras.

When the ball arrives at a predefined positional relation to aborderline, say into a distance of less than two meters from theborderline, there is automatically initiated a streaming of imagescaptured in a resolution of 15360×8640 pixels, say by one or moreselected 16K digital camera(s) positioned next to the borderline.

Consequently, a position of the ball may be more accurately determined,as described in further detail hereinbelow.

In a second example, during a tennis match, a ball's motion may betracked, through an analysis of images captured during the match, in a50 Hz image capture frequency—i.e. with an accuracy of 20 milliseconds.

When the ball arrives at a predefined positional relation to aborderline, say into a distance of less than two meters from theborderline, there is automatically initiated a streaming of imagescaptured in a 1000 Hz image capture frequency—i.e. with an accuracy ofone millisecond—by one or more selected cameras.

Optionally, the selected camera is a camera closest to a segment of theborderline which the ball's trajectory seems to lead to, as described infurther detail hereinbelow.

Consequently, there is enabled, a capturing of the image of the ball atthe exact millisecond of the ball's landing, which capturing is muchlikely to occur with the capture rate of 50 Hz, having the accuracy of20 milliseconds only.

The improved accuracy resultant upon the higher pixels to time ratio,may allow a broadcaster, to provide spectators of a sport event, say ona TV channel or web site, with a close up video which clearly shows theevent of interest, say the out event as captured in the higher pixels totime ratio. Further, the close up video is likely to remove doubts as toa decision, say of a line judge.

The improved accuracy which is resultant upon the higher pixels to timeratio, may also make the tracking of the ball's motion during the eventof interest more accurate, and provide for more accuratecharacterization the event, as described in further detail hereinbelow.

Thus, according to an exemplary embodiment, during the tennis match,interesting events, such as an out event, are captured with the higherpixels to time ratio—which pixels to time ratio provides for greateraccuracy, but involves the processing of a greater number of pixels perminute (i.e. of a higher volume of data), and is thus likely to beheavier in terms of bandwidth consumed by the streamed images, datastorage, etc.

However, the images of the remaining parts of the sport event arecaptured and received in the lower pixels to time ratio—which is lessaccurate, but lighter in terms of bandwidth consumption, data storage,etc.

Consequently, there may be provided a potentially optimal usage of thecameras of different pixels to time ratios (i.e. cameras which differ inimage capture frequency, image capture resolution, or in both imagecapture frequency and image capture resolution, as described in furtherdetail hereinabove.

The principles and operation of a method and an apparatus according tothe present invention may be better understood with reference to thedrawings and accompanying description.

Before explaining at least one embodiment of the invention in detail, itis to be understood that the invention is not limited in its applicationto the details of construction and the arrangement of the components setforth in the following description or illustrated in the drawings.

The invention is capable of other embodiments or of being practiced orcarried out in various ways. Also, it is to be understood that thephraseology and terminology employed herein is for the purpose ofdescription and should not be regarded as limiting.

Reference is now made to FIG. 1, which is a block diagram schematicallyillustrating an exemplary apparatus, for controlling cameras in a sportevent, according to an exemplary embodiment of the present invention.

An apparatus 10 for controlling cameras in a sport event, according toan exemplary embodiment of the present invention, includes a computer.The computer may be a single computer or a group of computers incommunication over a network.

The apparatus 10 further includes one or more additional parts 11-14.The parts 11-14 may be implemented as software, as hardware, or as acombination of hardware and software, on the computer, as described infurther detail hereinbelow.

The apparatus 10 communicates with one or more cameras, for receivingimages, for initiating a streaming of images, etc., as described infurther detail hereinbelow.

In one example, the apparatus 10 communicates with cameras positionedaround a constrained environment (say a football field or a tenniscourt) in which a sport event takes place, and with cameras positionedover specific parts of the constrained environment.

The apparatus 10 may control one of more of the cameras, for receivingimages captured with a first pixels to time ratio—say in a firstfrequency and resolution, from one or more of the cameras, for receivingimages captured in a higher pixels to time ratio, from the same or othercamera(s), etc., as described in further detail hereinbelow.

The apparatus 10 includes an image receiver 11, implemented on thecomputer.

The image receiver 11 receives images of a sport event taking place in aconstrained environment (say a football field, a soccer field, a tenniscourt, etc.), during the sport event.

The received images are captured with a first pixels to time ratio, sayin a first image capture frequency and resolution, say by one or more ofthe cameras positioned around the constrained environment, as describedin further detail hereinbelow.

The apparatus 10 further includes a motion tracker 12, in communicationwith the image receiver 11.

The motion tracker 12 tracks a motion of a first object (say a ball or aplayer), using the images which are captured in the first pixels to timeratio, during the sport event, in real time.

Optionally, the motion tracker 12 tracks the motion through astereoscopic or other analysis of the images captured in the firstpixels to time ratio, which analysis yields a three dimensional (3D)space which represents the constrained environment, and objects inmotion over the constrained environment, as known in the art.

The apparatus 10 further includes a positional relation detector 13, incommunication with the motion tracker 12.

The positional relation detector 13 detects an arrival of the firstobject (say the ball) into a predefined positional relation to a secondobject (say a borderline, a tennis net, a soccer gate, etc.), during themotion tracked by the motion tracker 12.

Optionally, the positional relation detector 13 bases the detecting ofthe arrival of the first object into the predefined positional relationto a second object, at least on a predefined change in shape of one ofthe objects (say a squeezing of the ball when the ball hits a racket ora wall, a change in shape of a tennis net when hit by the ball, etc.).

Optionally, the positional relation detector 13 bases the detection ofthe arrival of the first object into the predefined positional relationto a second object, at least on a distance between the objects.

In one example, the positional relation detector 13 detects an arrivalof a ball at a distance of below a predefined threshold from a netpositioned in the middle of a constrained environment such as a tenniscourt, or from a borderline, during the ball's motion away from a playerwho strikes the ball with a racket, as tracked by the motion tracker 12.

In another example, the positional relation detector 13 detects anarrival of a ball at a distance of above a predefined threshold from aplayer, during the ball's motion away from a player who kicks the ball,as tracked by the motion tracker 12.

In yet another example, the positional relation detector 13 detects anarrival of a player at a distance of below a predefined threshold from afootball gate during the player's motion, as tracked by the motiontracker 12.

The apparatus 10 further includes a camera controller 14, incommunication with the positional relation detector 13.

The camera controller 14 initiates a streaming of images being capturedwith a pixels to time ratio higher than the first pixels to time ratio,say in a higher frequency, in a higher resolution, or in both a higherfrequency and a higher resolution, by one or more selected camera(s),upon the arrival detected by the positional relation detector 13.

Optionally, the apparatus 10 further includes a camera selector (notshown), in communication with the camera controller 14.

The camera selector selects the one or more camera(s) which capture(s)the images in the higher pixels to time ratio, among a set whichconsists of two or more cameras, based on a rule predefined by anoperator, an administrator, a developer of the apparatus 10, etc.

Optionally, the camera selector selects the one or more camera(s), basedon a trajectory of the tracked motion of the first object and on thefield of view of the selected camera(s), as described in further detailhereinbelow.

Each of the cameras in the set has a different field of view, which maycover a respective, different part of the constrained environment, sayan area around a net deployed on a volleyball court, a specific segmentof a borderline of a tennis court, an area around a soccer gate, etc.

Optionally, the selected cameras include two or more cameras deployed atdifferent sides of a same part of the constrained environment (say onthe two ends of the net, or of a borderline segment), with a different(say opposite) field of view on the same part, as described in furtherdetail hereinbelow.

Optionally, images captured by the two (or more) selected camerasdeployed at different sides of a same part, are stereoscopicallyanalyzed, to yield a three dimensional (3D) space which represents thepart of the constrained environment covered by the selected cameras andobjects in motion over that covered part, as described in further detailhereinbelow.

Optionally, upon the detection of the arrival of the first object intothe predefined positional relation, the camera controller 14 furtherinitiates a change in field of view of the selected camera(s), say bytriggering an actuator.

Consequently, the actuator moves the selected camera, in a rotationalmotion or in a motion which involves a change of tilt, so as to capturea part of the constrained environment, approached by the first object.

In one example, the selected camera is a camera also used for capturingat least some of the images captured with the first pixels to time ratioand received by the image receiver 11, and the camera controller 14changes the selected camera's pixels to time ratio to the higher pixelsto time ratio, upon the detected arrival. That is to say that in theexample, the same camera is used to capture images with both pixels totime ratios, as described in further detail hereinbelow.

Optionally, the camera controller 14 further stops the streaming of theimages which are captured with the pixels to time ratio higher than thefirst pixels to time ratio, from the selected cameras, according to apredefined rule.

In one example, the camera controller 14 stops the streaming after apredefined number of seconds following the detected arrival into thepositional relation, upon a subsequent detection of an end of the event(say a crossing of the ball into an area within a soccer gate, a fewseconds after that crossing, etc.), as described in further detailhereinbelow.

Optionally, the apparatus 10 further includes an image combiner (notshown), in communication with the camera controller 14.

The image combiner combines at least a few of the images captured withthe first pixels to time ratio and at least a few of the images capturedwith the higher pixels to time ratio, for creating an effect of zoominginto a part of the constrained environment covered by a field of view ofthe selected camera(s).

The improved accuracy resultant upon the higher pixels to time ratio,may thus allow a broadcaster, to provide spectators of a sport event,say on a TV channel or a website, with a close up video which creates aneffect of zooming into the part of the constrained environment coveredby the field of view of the selected camera(s).

Consequently, there may be clearly shown the event of interest, say anout event captured in the higher pixels to time ratio, in the close upvideo. The video may thus potentially remove doubts as to properofficiating (say as to a decision made by an official such as a linejudge during a football match or a tennis match).

In one example, the apparatus 10 further includes two or more lowfrequency (LF) cameras, i.e. cameras of a low image capture frequency(say of 25 Hz), positioned around the constrained environment, say fourLF cameras positioned roughly at the four corners of a tennis or asoccer field. The LF cameras capture the images in the low frequency,during the sport event, and forward the captured images to imagereceiver 11, as described in further detail hereinabove, and asillustrated using FIG. 3A and FIG. 3B.

In the example, the apparatus 10 further includes several high frequency(HF) cameras, i.e. cameras of a high image capture frequency (say of2,000 Hz), positioned at different parts of the constrained environment.Each the HF cameras has a different field of view, which covers arespective, different part of the constrained environment, say an areaaround a net, a specific segment of a borderline of a football field ora soccer field, etc., as described in further detail hereinbelow.

Upon the detection of the first object's arrival into the positionalrelation, as detected by the positional relation detector 13, using theimages captured by the LF cameras in the low frequency, the imagescaptured by the HF cameras in the high frequency, are streamed to theapparatus 10, in real time (or in near real time), as described infurther detail hereinabove, and as illustrated using FIG. 3A and FIG.3B.

Reference is now made to FIG. 2, which is flowchart schematicallyillustrating an exemplary method, for controlling cameras in a sportevent, according to an exemplary embodiment of the present invention.

A first exemplary method for controlling cameras in a sport event,according to an exemplary embodiment of the present invention, may beexecuted by a computer. The computer may be a single computer or a groupof computers in communication over a network.

For carrying out the method, the computer communicates with one or morecameras, through the internet, an intranet network, or local areanetwork, another network, or any combination thereof, for receivingimages, initiating a streaming of images from one or more cameras,stopping a streaming of images from one or more cameras, etc., asdescribed in further detail hereinbelow.

In one example, the cameras are positioned around a constrainedenvironment (say a football field or a tennis court) in which a sportevent takes place, and over specific parts of the constrainedenvironment, as described in further detail hereinbelow.

In the method, there are received 21 images of a sport event takingplace in a constrained environment (say a football field, a soccerfield, a tennis court, etc.), during the sport event, say by the imagereceiver 11, as described in further detail hereinabove.

The received 21 images are captured with a first pixels to time ratio,say in a first image capture frequency and a first resolution, say byone or more of the cameras positioned around the constrainedenvironment, as described in further detail hereinbelow.

During the receiving 21, there is tracked 22 a motion of a first object(say a ball or a player), using the received 21 images which arecaptured with the first pixels to time ratio, during the sport event, inreal time (or in near real time), say by the motion tracker 12, asdescribed in further detail hereinabove.

Optionally, the motion is tracked 22 through a stereoscopic or otheranalysis of the images captured with the first pixels to time ratio,which analysis yields a three dimensional (3D) space which representsthe constrained environment and objects in movement over the constrainedenvironment, as known in the art.

When the first object (say the ball) arrives into a predefinedpositional relation to a second object (say a tennis net, a soccer gate,etc.), during the tracked 22 motion, in real time (or near real time),the arrival of the first object into the positional relation to a secondobject is detected 23, say by the positional relation detector 13, asdescribed in further detail hereinabove.

Optionally, the detection 23 is based at least on a predefined change inshape of one of the objects (say a squeezing of the ball when the ballhits a racket or a wall, or a change in shape of a tennis net hit by theball).

Optionally, the detection 23 is based at least on a distance between theobjects. In one example, there is detected 23 an arrival of a ball at adistance of below a predefined threshold from a net positioned in themiddle of a constrained environment such as a tennis court, or from aborderline, during the ball's motion away from a player who strikes theball with a racket, as tracked by the motion tracker 12.

In another example, there is detected 23 an arrival of a ball at adistance of above a predefined threshold from a player who kicks theball, during the ball's motion away from the player, as tracked by themotion tracker 12.

In yet another example, there is detected 23 an arrival of a player at adistance of below a predefined threshold from a football gate, duringthe player's motion, as tracked by the motion tracker 12.

Upon the detection 23 of the arrival into the positional relation, thereis automatically initiated 24 a streaming of images being captured in apixels to time ratio higher than the first pixels to time ratio, say ina higher image capture frequency, in a higher image resolution, or inboth a higher image capture frequency and a higher image resolution, byone or more selected camera(s).

Optionally, the one or more camera(s) are selected, say by the cameraselector of apparatus 10, among a set which consists of two or morecameras, as described in further detail hereinabove.

Optionally, the selection is based on a rule predefined by an operator,an administrator, a developer of the apparatus 10, etc.

Optionally, the one or more camera(s) are selected based on a trajectoryof the tracked 22 motion of the first object and on the field of view ofthe selected camera(s), as described in further detail hereinbelow.

Each of the cameras in the set has a different field of view, which maycover a respective, different part of the constrained environment, sayan area around a net deployed on a volleyball court, a specific segmentof a borderline of a tennis court, an area around a soccer gate, etc.

Optionally, the selected cameras include two or more cameras deployed atdifferent sides of a same part of the constrained environment (say onthe two ends of the net, or of a borderline segment), with a different(say opposite) field of view on the same part, as described in furtherdetail hereinbelow.

Optionally, images captured by the two (or more) selected camerasdeployed at different sides of a same part, are stereoscopicallyanalyzed, to yield a three dimensional (3D) space which represents thepart of the constrained environment covered by the selected cameras andobjects in motion over that covered part, as described in further detailhereinbelow.

Optionally, there is further initiated a change in field of view of theselected camera(s), say by triggering an actuator, say by the cameracontroller 14, as described in further detail hereinabove.

Consequently, the actuator moves the selected camera, in a rotationalmotion or in a motion which involves a change of tilt, so as to capturea part of the constrained environment, approached by the first object.

In one example, the selected camera is a camera also used to capture atleast some of the received 21 images captured with the first pixels totime ratio, and the selected camera's pixels to time ratio is changed tothe higher pixels to time ratio (say to a mode with a higher imagecapture frequency, with a higher image capture resolution, or withboth), upon the detected 23 arrival, say by the camera controller 14.That is to say that in the example, the same camera is used to captureimages with both pixels to time ratios.

Optionally and subsequently, the streaming of the images captured withthe pixels to time ratio higher than the first frequency pixels to timeratio, from the selected cameras, is stopped according to a predefinedrule, say by the camera controller 14, as described in further detailhereinabove.

In one example, the camera controller 14 stops the streaming after apredefined number of seconds after the detected 23 arrival into thepositional relation, upon a subsequent detection of an end of the event(say a crossing of the ball into an area within a soccer gate, or a fewseconds after that crossing, etc), say by the positional relationdetector 13, as described in further detail hereinbelow.

Optionally, at least a few of the images captured with the first pixelsto time ratio and at least a few of the images captured with the higherpixels to time ratio, are combined, say by the image combiner ofapparatus 10, for creating an effect of zooming into a part of theconstrained environment covered by a field of view of the selectedcamera(s).

The improved accuracy resultant upon the higher pixels to time ratio,may thus allow a broadcaster, to provide spectators of a sport event,say on a TV channel or a web site, with a close up video which createsan effect of zooming into the part of the constrained environmentcovered by the field of view of the selected camera(s). Consequently,there may be clearly shown the event of interest, say an out event,captured with the higher pixels to time ratio, in the close up video.The video may thus potentially remove doubts as to proper officiating(say as to a decision made by an official such as a line judge during afootball match or a tennis match).

Throughout the sport event, there is tracked 22 the motion of objects inthe constrained environment, say by the motion tracker 12.

Optionally, the tracking 22 of the motion of objects (say the firstobject) is based on a derivation of a position of the objects in athree-dimensional space representing the constrained environment fromboth the images captured with the first pixels to time ratio and theimages captured with the higher pixels to time ratio.

In one example, during the streaming 24 of the images captured with thehigher pixels to time ratio, the derivation of the position of the firstobject using the images captured with the higher pixels to time ratioand the tracking 22 (say by the motion tracker 12), are based on astereoscopic analysis of images captured with the higher pixels to timeratio. The images are captured with the higher pixels to time ratio, bytwo selected cameras positioned at opposite sides of the part of theconstrained environment, which part is covered by a field of view of theselected cameras.

In the one example, when the images received 21 (say by the imagereceiver 11) are of a low pixels to time ratio, say a low resolution ora low frequency of capturing, the derivation of the position of thefirst object, and the tracking 22 (say by the motion tracker 12), arerather based on the received 21 images captured with the first pixels totime ratio, as described in further detail hereinabove.

Alternatively, the tracking of the motion of objects (say the firstobject) is based on a derivation of the position of the first objectfrom the images captured only with the first pixels to time ratio.

In one example, the streamed images are captured with the higher pixelsto time ratio, by a single camera which covers the part of theconstrained environment approached by the first object, and the streamedimages captured with the higher pixels to time ratio, are used forgenerating the close up video, but not for the derivation of theposition of the first object, and not for the tracking 22 of the motion.

In one example, the received 21 images are images captured during asport event, by two or more low frequency (LF) cameras, i.e. cameras ofa low image capture frequency (say of 25 Hz), positioned around theconstrained environment, say by four LF cameras positioned roughly overthe four corners of a tennis court or a soccer field, as described infurther detail hereinabove.

The LF cameras capture the images received 21 in real time, as describedin further detail hereinabove, and as illustrated using FIG. 3A and FIG.3B.

In the example, there are further employed several high frequency (HF)cameras, i.e. cameras of a high image capture frequency (say of 2,000Hz), positioned at different parts of the constrained environment. Eachthe HF cameras has a different field of view, which covers a respective,different part of the constrained environment, say an area around a net,a specific segment of a borderline of a football field or a soccerfield, etc., as described in further detail hereinbelow.

Upon the detection 23 of the first object's arrival into the positionalrelation, images captured by the HF cameras in high frequency, arestreamed 24 in real time (or near real time), say to the apparatus 10,as described in further detail hereinabove, and as illustrated usingFIG. 3A and FIG. 3B.

Thus, with the exemplary embodiment, on the one hand, interesting events(such as an out event) are captured and streamed 24 with the higherpixels to time ratio, say in the higher frequency and/orresolution—which provides for greater accuracy.

Indeed, with an availability of a greater number of images per minute(i.e. with a higher image capture frequency), the exact moment in whichan event (say a landing of a ball) occurs, is more likely to becaptured.

Further, the tracking 22 of the first object's (say the ball's) motion,say using the three dimensional (3D) space, may involve a calculationaimed at deriving the path taken by the first object during the timeperiod in between each pair of consecutive images of the first object ascaptured, thus “bridging” between the images, for deriving thetrajectory of the first object.

With the higher image capture frequency, an average path taken by thefirst object in the time periods between two consecutive images of thefirst object is shorter than with the first image capture frequency,since with the higher image capture frequency, the time period inbetween each pair of consecutive images is by definition, shorter thanwith the first, lower image capture frequency.

Consequently, the calculation aimed at deriving the paths are likely tobe simpler, and computationally lighter.

Further, with an availability of a greater number of pixels per image(i.e. with a higher image capture resolution), an exact position of theobjects (say the ball), is more likely to be determined, since eachpixel represents a smaller area of the constrained environment.

However, the higher accuracy involves the processing of a greater numberof pixels per time, and may thus be heavier in terms of bandwidthconsumption, data storage, etc.

On the other hand, the images of the remaining parts of the sport eventare captured and received with the lower pixels to time ratio—which isless accurate, but lighter in terms of bandwidth consumption, datastorage, etc.

Consequently, there may be provided a potentially optimal usage ofcameras of different pixels to time ratios (say different image capturefrequencies different image capture resolutions, or both), say during alive broadcast of the sport event on a TV channel or over the internet.

Reference is now made to FIG. 3A which is a block diagram schematicallyillustrating a first exemplary scenario of controlling cameras in asport event, according to an exemplary embodiment of the presentinvention.

In one exemplary scenario, the apparatus 10 further includes four lowfrequency (LF) cameras 30, i.e. cameras of a relatively low imagecapture frequency (say of 25 Hz), positioned roughly over the fourcorners of a tennis court 34. Each the LF cameras 30 faces the court's34 opposite half, with a tilt angle which is slightly lower thanhorizontal, such that the camera's field of view extends into thecourt's 34 half at the other side of the court 34, as described infurther detail hereinbelow.

The image receiver 11 receives 21 the images captured in the relativelylow frequency, as described in further detail hereinabove.

In the example, during a tennis match, the motion tracker 12 tracks 22the motions of the first object (say a ball) through an analysis of theimages captured by the LF cameras 30, in an image capture frequency of25 Hz, i.e. with an accuracy of 40 milliseconds.

In the example, the apparatus 10 further includes several high frequency(HF) cameras 31, i.e. cameras of a higher image capture frequency (sayof 2,000 Hz), positioned at different parts of the tennis court 34. Eachthe HF cameras 31 has a different field of view, which covers arespective, different part of the tennis court 34, say an area around anet, a specific segment of a borderline of the tennis court, etc., asdescribed in further detail hereinbelow.

When the ball arrives at a predefined positional relation to a secondobject (say a borderline), say into a distance of less than two metersfrom a segment of the borderline, the arrival is detected 23 by theposition relation tracker 13. Consequently, the camera controller 14automatically selects among the HF cameras 31, a camera which bestcovers the borderline segment approached by the ball (as evident fromthe ball's trajectory). Then, the camera controller 14 automaticallyinitiates 24 a streaming of images captured in a 2,000 Hz capturefrequency—i.e. with an accuracy of half a millisecond, from the selectedcamera.

Consequently, there is enabled, a capturing of the image of the ball'slanding at a point within the court's 34 borderlines or out of thecourt's 34 borderlines, next to the borderline segment, at the exacthalf a millisecond of the ball's landing.

As a result, the ball's landing is more clearly determinable as an “out”or an “in” landing, than when having to rely on the LF camera's 30 imagecapture frequency of 25 Hz—i.e. with an accuracy of 40 millisecondsonly.

Further, a close up video of the landing which clearly shows that “in”or “out” landing of the ball, may be automatically generated andbroadcasted, as described in further detail hereinabove.

Thus, with the exemplary embodiment, interesting events such as an outevent are streamed in the higher frequency—which provides for greateraccuracy, but involves the processing of a greater number of images perminute, and is thus likely to be heavier in terms of bandwidthconsumption, data storage, etc.

However, the images of the remaining parts of the sport event arecaptured and received in the lower frequency—which is less accurate, butlighter in terms of bandwidth consumption, data storage, etc.

Consequently, there may be provided a potentially optimal usage of thecameras of different image capture frequencies, as described in furtherdetail hereinabove.

Reference is now made to FIG. 3B which is a block diagram schematicallyillustrating a second exemplary scenario of controlling cameras in asport event, according to an exemplary embodiment of the presentinvention.

In a second exemplary scenario, the apparatus 10 further includes fourlow resolution (LR) cameras 32, i.e. cameras of a relatively low imagecapture resolution (say of a 2048×1080 resolution), positioned roughlyover the four corners of a soccer field 35. Each the LR cameras 32 facesthe field's 35 opposite half, with a tilt angle which is slightly lowerthan horizontal, such that the camera's field of view extends into thefield's 35 half at the other side of the field 35.

The image receiver 11 receives 21 the images captured in the relativelylow resolution, as described in further detail hereinabove.

In the example, during a game of soccer, the motion tracker 12 tracks 22the motions of the first object (say a first player when running with aball), in real time (or near real time) through an analysis of theimages captured by the LR cameras 32, in the 2048×1080 resolution.

In the example, the apparatus 10 further includes several highresolution (HR) cameras 33, i.e. cameras of a higher image captureresolution (say of a 15360×8640 resolution), positioned at differentparts of the soccer field 35. Each the HR cameras 33 has a differentfield of view, which covers a respective, different part of the field35, say an area around one of the gates, a borderline segment, etc.

A second player (i.e. a second object) approaches the first player (withan intention to kick the first player), say into a distance of less thanone meter from the first player, thus putting the first player in apredefined positional relation to the second player—namely, into adistance of less than one meter from the second player.

Consequently, the arrival of the first player into the predefinedpositional relation is detected 23 by the position relation tracker 13.

Upon the detection 23, the camera controller 14 automatically selectsamong the HR cameras 33, a camera which best covers the area around thetwo players. Then, the camera controller 14 automatically initiates 24 astreaming of images captured in the higher resolution (i.e. in the15360×8640 resolution)—, from the selected camera.

Consequently, there is enabled, a capturing of the image of the secondplayer when apparently, kicking the first player, at the 15360×8640resolution which is significantly higher than the 2048×1080 resolutionof the LR cameras 32, which higher resolution shows the event in finerdetail.

As a result, the apparent kicking may be more clearly determinable as a“Foul”, than when having to rely on the LR camera's 32 image of thelower resolution.

Further, a close up video of the kicking, which clearly shows the“Foul”, may be automatically generated and broadcasted, as described infurther detail hereinabove.

Consequently, there may be provided a potentially optimal usage of thecameras of different image capture frequencies, as described in furtherdetail hereinabove.

Reference is now made to FIG. 4, which is a block diagram schematicallyillustrating an exemplary computer readable medium storing computerexecutable instructions for performing steps of controlling cameras in asport event, according to an exemplary embodiment of the presentinvention.

According to an exemplary embodiment of the present invention, there isprovided a non-transitory computer readable medium 40, such as a CD-ROM,a USB-Memory, a Portable Hard Disk, etc.

The computer readable medium 40 stores computer executable instructions,for performing steps of controlling cameras in a sport event. Theinstructions may be executed upon one or more computer processors.

The computer executable instructions include a step of receiving 41images of a sport event taking place in a constrained environment (say afootball field, a soccer field, a tennis court, etc.), during the sportevent, as described in further detail hereinabove.

The received 41 images are captured with a first pixels to time ratio,say in a first image capture frequency and resolution, say by one ormore cameras positioned around the constrained environment, as describedin further detail hereinabove.

The computer executable instructions further include a step of tracking42 a motion of a first object (say a ball or a player), using thereceived 41 images which are captured with the first pixels to timeratio, during the receiving 41, as the sport event progresses (i.e. inreal time), as described in further detail hereinabove.

Optionally, the motion is tracked 42 through a stereoscopic or otheranalysis of the images captured with the first pixels to time ratio,which analysis yields a three dimensional (3D) space which representsthe constrained environment, as known in the art.

The computer executable instructions further include a step of detecting43 an arrival of the first object (say the ball) into a predefinedpositional relation to a second object (say a tennis net, a soccer gate,etc.), during the tracked 42 motion, as described in further detailhereinabove.

Optionally, the detection 43 is based at least on a predefined change inshape of one of the objects (say a squeezing of the ball when the ballhits a racket or a wall, a change in shape of a net, etc.).

Optionally, the detection 43 is based at least on a distance between theobjects.

In one example, there is detected 43 an arrival of a ball at a distanceof below a predefined threshold from a net positioned in the middle of aconstrained environment such as a tennis court, or from a borderline,during a tracked 42 motion of the ball, away from a player who strikesthe ball with a racket.

In another example, the there is detected 43 an arrival of a ball at adistance of above a predefined threshold from a player, during a tracked42 motion of the ball, away from a player who kicks the ball.

In yet another example, there is detected 43 an arrival of a player at adistance of below a predefined threshold from a football gate, duringthe player's motion, as tracked 42.

The computer executable instructions further include a step of, upon thedetection 43 of the arrival into the positional relation, automaticallyinitiating 44 a streaming of images captured with a pixels to time ratiohigher than the first pixels to time ratio by one or more selectedcamera(s), as described in further detail hereinabove.

Optionally, the computer executable instructions further include a stepof selecting the one or more camera(s) among a set which consists of twoor more cameras, as described in further detail hereinabove.

Optionally, the selection is based on a rule predefined by an operator,an administrator, a developer (say programmer) of the instructionsstored on the computer readable medium 40, etc.

Optionally, the one or more camera(s) are selected based on a trajectoryof the tracked 42 motion of the first object and on the field of view ofthe selected camera(s), as described in further detail hereinabove.

Each of the cameras in the set has a different field of view, which maycover a respective, different part of the constrained environment, sayan area around a net deployed on a volleyball court, a specific segmentof a borderline of a tennis court, an areas around a soccer gate, etc.

Optionally, the selected cameras include two or more cameras deployed atdifferent sides of a same part of the constrained environment (say onthe two ends of the net, or of a borderline segment), with a different(say opposite) field of view on the same part, as described in furtherdetail hereinbelow.

Optionally, the computer executable instructions further include a stepof initiating a change in a field of view of the selected camera(s), sayby triggering an actuator, as described in further detail hereinabove.

Consequently, the actuator moves the selected camera, in a rotationalmotion or in a motion which involves a change of tilt, so as to capturea part of the constrained environment, approached by the first object.

In one example, the selected camera is a camera also used to capture atleast some of the received 41 images captured with the first pixels totime ratio, and the computer executable instructions further include astep of changing the selected camera's pixels to time ratio. Forexample, the computer executable instructions may include a step ofchanging the selected camera's image capture frequency, resolution, orboth the frequency and the resolution, to a higher value, upon thedetected 43 arrival. That is to say that in the example, the same camerais used to capture images with both pixels to time ratios.

Optionally and subsequently, the computer executable instructionsfurther include a step of stopping the streaming of the images capturedin the pixels to time ratio higher than the first pixels to time ratio,from the selected cameras.

Optionally, the streaming is stopped according to a predefined rule, asdescribed in further detail hereinbelow.

In one example, the streaming is stopped after a predefined number ofseconds after the detected 43 arrival into the positional relation, upona subsequent detection of an end of the event (say a crossing of theball into an area within a soccer gate or a few seconds after thatcrossing), etc., as described in further detail hereinabove.

Optionally, the computer executable instructions further include a stepof combining at least a few of the images captured with the first pixelsto time ratio and at least a few of the images captured with the higherpixels to time ratio, for creating a zooming effect into a part of theconstrained environment covered by a field of view of the selectedcamera(s).

The improved accuracy resultant upon the higher pixels to time ratio,may thus allow a broadcaster, to provide spectators of a sport event,say on a TV channel or a web site, with a close up video which createsan effect of zooming into the part of the constrained environmentcovered by the field of view of the selected camera(s), as described infurther detail hereinabove.

Consequently, there may be clearly shown the event of interest, say anout event, captured in the higher pixels to time ratio, in the close upvideo. The video may thus potentially remove doubts as to properofficiating (say as to a decision made by an official such as a linejudge during a football match or a tennis match).

With the executed instructions, throughout the sport event, there istracked 42 the motion of objects in the constrained environment.

Optionally, the tracking 42 of the motion of objects (say the firstobject) is based on a derivation of a position of the objects in athree-dimensional space representing the constrained environment, fromboth the images captured with the first pixels to time ratio and theimages captured with the higher pixels to time ratio.

For example, the derivation of the position of the first object usingthe images captured in the higher pixels to time ratio, may be based ona stereoscopic analysis of images captured with the higher pixels totime ratio, by the selected cameras, say by two cameras positioned atopposite sides of the part of the constrained environment covered by afield of view of the selected cameras.

Alternatively, the tracking 42 of the motion of objects (say the firstobject) is based on a derivation of the position of the first object ina three-dimensional space which represents the constrained environment,from the images captured only with the first pixels to time ratio.

In one example, the streamed images are captured with the higher pixelsto time ratio, by a single camera which covers the part of theconstrained environment approached by the first object, and the streamed44 images captured with the higher pixels to time ratio are used forgenerating a close up video. However, in the example, the streamed 44images, captured with the higher pixels to time ratio, are not used forthe derivation of the position of the first object, and not for thetracking 42 of the motion, as described in further detail hereinabove.

Further Discussion

An exemplary method for enabling analysis of events, comprises thefollowing steps: receiving a stream of multimedia data of a real sportgame in real time (RT) from a camera in a low frequency mode, analyzingthe stream of multimedia data to identify an action which signifies abeginning of event by comparison to predefined templates, and tracking amotion of a first object by the camera in the low frequency mode.

The exemplary method further includes subsequent steps of predicting anevent that may affect a score in the real sport game, identifying a highfrequency camera having the best field of view to the second object thatthe first object is approaching to, activating the identified highfrequency camera to transmit a stream of multimedia data, anddetermining an end of high frequency camera activation, according topredefined rules.

Reference is now made to FIG. 5, which is a top 3D perspective diagramof a system incorporated in a real sport environment, according to anexemplary embodiment of the present invention.

Embodiments of the present invention provide a smart-court system foridentifying a location of an object (say a falling ball) in a real sportgame, in real time (RT) or in near real time.

In a non-limiting example, the system illustrated in FIG. 5, may beincorporated in a tennis court, in which a real player 110 may betraining or competing against a real player 140 with a tennis ball 120,say in a tennis match.

The system may include one or more computers 150, on which one or moremodules are implemented as software, hardware, or a combination ofsoftware and hardware, as described in further detail hereinbelow.

The one or more computers 150 may also include additional modules, sayfor broadcasting the sport event to a remote audience, say on TVchannels, or over a wide area network 180 such as the internet, tocomputers and smart phones 170, in use by members of the remoteaudience.

The system may identify in RT or near RT, during the real sport game,the location of an object. In one example, there may be calculated theexact contact area that the ball makes with the court during a bounceaction.

According to one aspect of the present invention, relatively lowfrequency cameras 130 may be utilized to monitor a 3D area within andaround the game's court, for capturing and tracking motion of the realtennis players 110 and 140 and the ball 120.

According to some embodiments of the present invention, there isprovided a data processing system that is connected to multiple lowfrequency cameras 130 and multiple high frequency cameras 190.

In a non limiting example, the low frequency cameras 130 have a 50 Hzfrequency, and the high frequency cameras 190 have a 1000 Hz frequency(the frequency representing the image capture rate).

Using only the high frequency cameras 190 for capturing large portionsof the games, results in large data files which require big data storageequipment and strong processing units.

The data processing system may be arranged to receive a stream ofmultimedia data of a RT or near RT sport session from the multiple lowfrequency cameras 130—in a non limiting example, from four low frequencycameras 130, and to determine when an event is about to occur.

The high frequency cameras 190 capture images along the lines of thecourt, with a horizontal, narrow viewing angle.

Alternatively, according to some embodiments, instead of a plurality ofhigh frequency cameras 190 and a plurality of low frequency cameras 130,as described hereinabove, there may be used a plurality of cameras whichoperate on a low frequency mode and when receiving a certain signal,switch to a higher frequency mode.

This type of cameras, that operate on a low frequency mode and on a highfrequency mode, according to demand, may have a wider angle of photo andthe direction of their transmission of multimedia file to the system maybe controlled.

According to some embodiments of the present invention, when the dataprocessing system identifies that an event such an out event, is aboutto occur, using the low frequency cameras 130, the data processingsystem selects a high frequency camera out of the multiple highfrequency cameras 190, and activates the high frequency camera.

Alternatively, the data processing system may send a signal to a camerathat operates in low frequency to change mode to high frequency mode anddetermine a shooting angle.

The activated high frequency camera 190 may forward multimedia data tothe data processing system, for analysis of the event.

The number of cameras may be related to the range of covering of photoof each camera. In a non limiting example, the number of high frequencycameras 190 may be fourteen, as illustrated using FIG. 3A, hereinabove.

Optionally, in order to identify an event during the real sport game,the system may perform an analysis by receiving a stream of a multimediadata of the real sport game in RT, from the low frequency cameras 130.The analysis may be for identifying when an event is about to occur anddetermine which one of the high frequency cameras 190 should beactivated, for capturing the event.

After an event is identified, the system may stop the activation of thehigh frequency camera. The forwarding of the multimedia data from thecamera to the system may be stopped: (i) arbitrarily, after a predefinedperiod; or (ii) after a predefined amount of time after identificationthat a ball 120 touches an object.

An event that the system may identify may be related to an object in thereal sport game.

In a non limiting example, an event may span the exact contact pointswhich the ball 120 makes with a tennis court during bouncing, whichcontact points may be used for determining if the ball 120 touches theline or not.

In another non limiting example, an event may be when a ball 120 hits ortouches the net or alternatively when the ball 120 fails to hit or touchthe net.

In yet another non limiting example, an event may be when a ball 120hits a racket.

In yet another non limiting example, in case the ball 120 doesn't touchthe line when the ball 120 bounces, the system may calculate thedistance between the point where the ball 120 hits the ground and theline.

Reference is now made to FIG. 6, which is a flowchart illustratingactivity of an event prediction module, according to an exemplaryembodiment of the present invention.

According to another aspect of the present invention, an eventprediction module of the exemplary data processing system, may receive astream of multimedia data of a real sport game in Real Time (RT) fromthe low frequency cameras 130 illustrated in FIG. 5 (stage 205).

The received stream of multimedia data may be analyzed by the eventprediction module, to identify a ball's 120 strike by comparison topredefined templates (stage 210). After the strike is identified, aballistic motion of the ball 120 may be tracked by the low frequencycameras 130 (stage 215).

According to another aspect of the present invention, the eventprediction module may analyze the motion and the activities of thetracked ball 120 with event templates which define a set of predefinedparameters related to motion of the ball 120 (stage 220).

Based on the analysis of the motion of the ball 120, the predictedtrajectory of the ball 120 may be calculated, in order to predict anevent with a potential to affect the score of the real sport game, suchas the ball's 120 approaching to an object which is at least one of: (i)a line; (ii) a net; and (iii) a racket (stage 225), and to identify ahigh frequency camera 190 that is closest to an object approached by theball (stage 230).

Then, there is selected and activated the identified high frequencycamera 190 which is closest to the object approached by the ball 120(stage 235).

The end of the high frequency camera's activation may be determinedaccording to predefined rules such as a predefined period of time oraccording to identification of the ball's 120 touching an object by theevent prediction module (stage 240). Thus, the end of selected highfrequency camera's activation may be arbitrary, after a predefinedamount of time, or alternatively, after a predefined of time after theend of event.

Reference is now made to FIG. 7, which is a flowchart illustratingactivity of high frequency frame analysis module, according to anexemplary embodiment of the present invention.

According to another aspect of the present invention, a high frequencyframe analysis module of the exemplary data processing system, mayreceive a sequence of frames (i.e. images) from a high frequency camera(stage 310).

In the sequence of frames, the high frequency frame analysis module maysearch for specific characteristics of a target object in the receivedframes (stage 315).

For example, if the target object is a ball 120, the characteristic thatis searched for, may be a change in the ball's 120 shape (i.e. that theball 120 is squeezed), when the ball 120 touches the ground.Alternatively, the shadow of the ball 120 may be tracked and analyzed toidentify if the ball 120 touches the ground.

In another example, the target object is a net, and when the ball 120touches the net, the movement of the net is identified.

When the part of the sequence of frames, which involves the event (i.e.the characteristic is searched for), is identified, the high frequencyframe analysis module may write the first and the last frames of thatpart, to an event log.

Optionally, the part of the sequence of frames which involves the eventfirst and last frames is extended to include the frames where thecharacteristics of the target object are identified plus a predefinedamount of frames before and a predefined amount of frames after theframes which involve the event (stage 320).

The frames in the part of the sequence are analyzed for determiningoccurrence of a type of event which may affect the score in the realsport game (stage 325). For example, an event which may affect thescore, may be determined to occur when the ball 120 touches the groundwithin the game boundaries (called ‘IN’), or when the ball 120 touchesthe boundaries' line or when the ball 120 falls outside the boundaries(called ‘OUT’).

Reference is now made to FIG. 8, which is a flowchart illustratingactivity of embedding frames in a multimedia stream module, according toan exemplary embodiment of the present invention.

According to another aspect of the present invention, a module of theexemplary data processing system, for embedding frames in a multimediastream, may receive several frames in the high frequency from the highfrequency frame analysis module, as described in further detailhereinabove (stage 410).

According to another aspect of the present invention, the first and lastreceived frames which relate to the identified event from the severalframes in high frequency, may be identified (stage 415).

According to another aspect of the present invention, the identifiedhigh frequency frames are integrated with the low frequency frames forcreating a video showing a zoom effect at the identified event (stage420).

According to another aspect of the present invention, after the framesin the high frequency are integrated with the low frequency frames, thusembedding the high frequency frames in the video showing the zoomeffect, which is made primarily of the low frequency frames, the videomay be broadcast online (stage 425).

In the above description, an embodiment is an example or implementationof the invention. The various appearances of “one embodiment”, “anembodiment” or “some embodiments” do not necessarily all refer to thesame embodiments.

Although various features of the invention may be described in thecontext of a single embodiment, the features may also be providedseparately or in any suitable combination. Conversely, although theinvention may be described herein in the context of separate embodimentsfor clarity, the invention may also be implemented in a singleembodiment.

Furthermore, it is to be understood that the invention can be carriedout or practiced in various ways and that the invention can beimplemented in embodiments other than the ones outlined in thedescription above.

The invention is not limited to those diagrams or to the correspondingdescriptions. For example, flow need not move through each illustratedbox or state, or in exactly the same order as illustrated and described.

Meanings of technical and scientific terms used herein are to becommonly understood as by one of ordinary skill in the art to which theinvention belongs, unless otherwise defined.

It is expected that during the life of this patent many relevant devicesand systems will be developed and the scope of the terms herein,particularly of the terms “Computer”, “Processor” “Camera”, “Network”,“Video”, “TV Channel”, “Website”, “USB-Memory”, “Portable Hard Disk” and“CD-ROM”, is intended to include all such new technologies a priori.

It is appreciated that certain features of the invention, which are, forclarity, described in the context of separate embodiments, may also beprovided in combination in a single embodiment. Conversely, variousfeatures of the invention, which are, for brevity, described in thecontext of a single embodiment, may also be provided separately or inany suitable subcombination.

Although the invention has been described in conjunction with specificembodiments thereof, it is evident that many alternatives, modificationsand variations will be apparent to those skilled in the art.Accordingly, it is intended to embrace all such alternatives,modifications and variations that fall within the spirit and broad scopeof the appended claims.

All publications, patents and patent applications mentioned in thisspecification are herein incorporated in their entirety by referenceinto the specification, to the same extent as if each individualpublication, patent or patent application was specifically andindividually indicated to be incorporated herein by reference. Inaddition, citation or identification of any reference in thisapplication shall not be construed as an admission that such referenceis available as prior art to the present invention.

What is claimed is:
 1. A method for controlling cameras in a sportevent, the method comprising steps executed by at least one computer,the steps comprising: during a sport event taking place in a constrainedenvironment, receiving images of the sport event, the images beingcaptured with a first pixels to time ratio; tracking a motion of a firstobject, using the images being captured with the first pixels to timeratio; detecting an arrival of the first object into a predefinedpositional relation to a second object, during the tracked motion; andupon the detected arrival, initiating streaming of images being capturedwith a pixels to time ratio higher than the first pixels to time ratioby at least one selected camera.
 2. The method of claim 1, wherein thehigher pixels to time ratio is resultant upon a higher resolution ofimage capture by the at least selected one camera.
 3. The method ofclaim 1, wherein the higher pixels to time ratio is resultant upon ahigher frequency of image capture by the at least one selected camera.4. The method of claim 1, further comprising deriving a position of thefirst object in a three-dimensional space representing the constrainedenvironment from the images captured only with the first pixels to timeratio, for said tracking.
 5. The method of claim 1, further comprisingderiving a position of the first object in a three-dimensional spacerepresenting the constrained environment from the images captured withthe first pixels to time ratio and the images being captured with thehigher pixels to time ratio, for said tracking.
 6. The method of claim1, further comprising selecting the at least one camera among aplurality of cameras, based on a predefined rule.
 7. The method of claim1, further comprising selecting the at least one camera among aplurality of cameras, each of the cameras having a different field ofview covering a respective part of the constrained environment, saidselecting being based on a trajectory of the tracked motion of the firstobject and on the field of view of the selected camera.
 8. The method ofclaim 1, wherein the selected cameras comprise at least two camerasdeployed at different sides of a same part of the constrainedenvironment, and having different fields of view on the same part. 9.The method of claim 1, further comprising combining at least a few ofthe images captured with the first pixels to time ratio and at least afew of the images captured with the higher pixels to time ratio, forcreating an effect of zooming into a part of the constrained environmentcovered by a field of view of the selected camera.
 10. The method ofclaim 1, further comprising changing a pixels to time ratio of theselected camera to the higher pixels to time ratio, upon the detectedarrival, wherein the selected camera is a camera also used to capture atleast some of the images captured with the first pixels to time ratio.11. The method of claim 1, wherein said detecting of the arrival of thefirst object into the predefined positional relation to the secondobject, is based at least on a predefined change in shape of one of theobjects.
 12. The method of claim 1, wherein said detecting of thearrival of the first object into the predefined positional relation tothe second object, is based at least on a distance between the objects.13. The method of claim 1, further comprising stopping the streaming ofthe images being captured with the pixels to time ratio higher than thefirst pixels to time ratio, according to a predefined rule.
 14. Themethod of claim 1, further comprising initiating a change in field ofview of the selected camera.
 15. An apparatus for controlling cameras ina sport event, the apparatus comprising: a computer; an image receiver,implemented on the computer, and configured to receive images of a sportevent taking place in a constrained environment, during the sport event,the images being captured with a first pixels to time ratio; a motiontracker, in communication with said image receiver, configured to tracka motion of a first object, using the images being captured with thefirst pixels to time ratio; a positional relation detector, incommunication with said motion tracker, configured to detect an arrivalof the first object into a predefined positional relation to a secondobject, during the tracked motion; and a camera controller, incommunication with said positional relation detector, configured toinitiate streaming of images being captured with a pixels to time ratiohigher than the first pixels to time ratio by at least one selectedcamera, upon the detected arrival.
 16. The apparatus of claim 15,further comprising a camera selector, in communication with said cameracontroller, configured to select the at least one camera among aplurality of cameras, based on a predefined rule.
 17. The apparatus ofclaim 15, further comprising a camera selector, in communication withsaid camera controller, configured to select the at least one cameraamong a plurality of cameras, based on a trajectory of the trackedmotion of the first object and on the field of view of the selectedcamera, each of the cameras having a different field of view covering arespective part of the constrained environment.
 18. The apparatus ofclaim 15, wherein the selected cameras comprise at least two camerasdeployed at different sides of a same part of the constrainedenvironment, and having different fields of view on the same part. 19.The apparatus of claim 15, further comprising an image combiner, incommunication with said camera controller, configured to combine atleast a few of the images captured with the first pixels to time ratioand at least a few of the images captured with the higher pixels to timeratio, for creating an effect of zooming into a part of the constrainedenvironment covered by a field of view of the selected camera.
 20. Theapparatus of claim 15, wherein said camera controller is furtherconfigured to change a pixels to time ratio of the camera to the higherfrequency, upon the detected arrival, wherein the selected camera is acamera also used to capture at least some of the images captured withthe first pixels to time ratio.
 21. The apparatus of claim 15, whereinsaid positional relation detector is further configured to base thedetecting of the arrival of the first object into the predefinedpositional relation to the second object, at least on a predefinedchange in shape of one of the objects.
 22. The apparatus of claim 15,wherein said positional relation detector is further configured to basethe detecting of the arrival of the first object into the predefinedpositional relation to the second object, at least on a distance betweenthe objects.
 23. The apparatus of claim 15, wherein said cameracontroller is further configured to stop the streaming of the imagesbeing captured with the pixels to time ratio higher than the firstpixels to time ratio, according to a predefined rule.
 24. The apparatusof claim 15, wherein said camera controller is further configured toinitiate a change in field of view of the camera.
 25. A non-transitorycomputer readable medium storing computer executable instructions forperforming steps of controlling cameras in a sport event, the stepscomprising: during a sport event taking place in a constrainedenvironment, receiving images of the sport event, the images beingcaptured with a first pixels to time ratio; tracking a motion of a firstobject, using the images being captured with the first pixels to timeratio; detecting an arrival of the first object into a predefinedpositional relation to a second object during the tracked motion; andupon the detected arrival, initiating streaming of images being capturedwith a pixels to time ratio higher than the first pixels to time ratioby at least one selected camera.